# Multi-speaker Support
Csm 1b
Apache-2.0
CSM (Conversational Speech Model) is a 1B-parameter speech generation model developed by Sesame, capable of generating RVQ audio encoding from text and audio inputs.
Speech Synthesis English
C
unsloth
2,667
5
Csm 1b Safetensors Fp16
Apache-2.0
CSM (Conversational Speech Model) is a 1-billion-parameter speech generation model developed by Sesame, capable of generating RVQ audio encoding from text and audio inputs.
Speech Synthesis
Transformers English

C
lunahr
79
5
Csm 1b
Apache-2.0
CSM is a 1B-parameter speech generation model developed by Sesame, capable of generating RVQ audio codes from text and audio inputs, supporting context-aware speech generation.
Speech Synthesis English
C
eustlb
5,144
3
Csm 1b Safetensors Quants
Apache-2.0
CSM (Conversational Speech Model) is a 1-billion-parameter speech generation model developed by Sesame, capable of generating RVQ audio encoding from text and audio inputs.
Speech Synthesis
Transformers English

C
lunahr
37
7
Csm 1b
Apache-2.0
A PyTorch-based text-to-speech model supporting Chinese speech synthesis, developed and released by SesameAILabs.
Speech Synthesis
C
nielsr
18
3
Yourtts Formosan Only Ithuan
Experimental speech synthesis model based on Amis and Taroko languages, trained using the ithuan dataset
Speech Synthesis Other
Y
united-link
14
0
F5 TTS Pt Br
Brazilian Portuguese text-to-speech model based on F5-TTS, supporting emotion tags and speaker feature control
Speech Synthesis Other
F
firstpixel
253
36
Parler Tts Mini V1.1
Apache-2.0
Parler-TTS Mini v1.1 is a lightweight text-to-speech model trained on 45,000 hours of audio data, capable of generating high-quality, natural-sounding speech with controllable features through simple text prompts.
Speech Synthesis
Transformers English

P
parler-tts
1,490
19
Speecht5 Tts Tr V1.0
MIT
A Turkish text-to-speech model fine-tuned from Microsoft SpeechT5, capable of generating natural speech
Speech Synthesis
Transformers Other

S
umarigan
959
8
Parler Tts Tiny V1
Apache-2.0
Lightweight text-to-speech model trained on 45,000 hours of audio data, capable of controlling voice attributes through text prompts
Speech Synthesis
Transformers English

P
parler-tts
67
1
Parler Tts Mini Expresso
Apache-2.0
Parler-TTS Mini: Expresso is a lightweight text-to-speech model fine-tuned on the Expresso dataset based on Parler-TTS Mini v0.1, supporting emotion and speaker control.
Speech Synthesis
Transformers English

P
parler-tts
1,489
107
Speecht5 Finetuned Facebook Voxpopuli French
MIT
A text-to-speech model fine-tuned on the VoxPopuli French dataset based on microsoft/speecht5_tts
Speech Synthesis
Transformers

S
Sandiago21
71
2
Featured Recommended AI Models